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ABSTRACT 

We study the free energy landscape of the small peptide Met-enkephalin. Our data 
were obtained from a generalized-ensemble Monte Carlo simulation taking the interactions 
among all atoms into account. We show that the free energy landscape resembles that 
of a funnel, indicating that this peptide is a good folder. Our work demonstrates that 
the energy landscape picture and folding concept, developed in the context of simplified 
protein models, can also be used to describe the folding in more realistic models. 
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INTRODUCTION 



It is well known that a large class of proteins folds spontaneously into unique, globular 
shape However, the mechanism of protein folding has remained elusive. To describe 
many biochemical processes it may be sufficient to assert that folding occurs on a time scale 
no slower than protein biosynthesis, and that the information required to find the precise 
three-dimensional shape is contained in the one- dimensional sequence of the molecule. 
This simple description may not be sufficient if the prediction of protein structure from 
sequence and the design of truly novel protein-like molecules are to be achieved. In order 
to answer the practical questions of structure prediction and design, it seems that one 
must go a considerable distance beyond this phenomenology — a new viewpoint may be 
required. Such a new viewpoint is emerging from the analytical and numerical studies of 
minimal protein models by several groups (see for example refs. [0-0). Its framework 
is provided by energy landscape theory and the funnel concept, which assert that a full 
understanding of the folding process requires a global overview of the landscape. The 
folding landscape of a protein resembles a partially rough funnel riddled with traps where 
the protein can transiently reside. There is no unique pathway but a multiplicity of 
convergent folding routes towards the native state 0-0. 

The importance of a funnel landscape can be seen by contrasting random heteropoly- 
meric molecules and proteins. Both random heteropolymers and proteins have an under- 
lying driving force to collapse, and for both molecules the various competing interactions 
within the molecule and between the molecule and the surrounding solvent lead to a 
rugged energy landscape. However, unlike random heteropolymers, proteins adopt well- 
defined three-dimensional structures because there is a sufficient overall slope of the energy 
landscape so that the numerous valleys flow in a funnel toward the native structure. 0, ||, 

The essence of the funnel landscape idea is competition between the tendency towards 
the folded state and trapping due to ruggedness of the landscape. This competition is 
measured by the ratio between the folding temperature (Tf) and the glass temperature 
{T g ). Good folding protein sequences fold rapidly on minimally frustrated landscapes 
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with large values of 7//T 9 j5|]-0]. Minimally frustrated sequences not only fold fast at 
relevant temperatures but are also robust folders, and therefore only weakly dependent 
on minor variations of the folding environment or to mutations. Energy landscape theory 
suggests a diversity of folding scenarios that have been explored by computer simulations 
of minimalist models (see for example references in [§, [Tj, an< ^ connections to 

studies of real proteins were proposed | 



A quantitative understanding of how the general parameters of the landscape relate 
to particular properties of a protein's sequence or final folded topology requires detailed 
molecular (and therefore less coarse-grained) calculations of folding free energy landscapes. 
Unfortunately, simulations of more realistic models of proteins where the interactions 
among all atoms in a protein are taken into account have been notoriously difficult (see 



for example Ref. [19] for a review). These studies involve extremely intensive numerical 
calculations, and thus the number and size of systems that can be explored is limited. 
In addition, it is difficult for such simulations to provide a direct view of protein folding 
dynamics. Because of the rough energy landscape, simulations based on canonical Monte 
Carlo or molecular dynamics techniques will get trapped at low temperatures in one of the 
multitude of local minima separated by high energy barriers. Hence, there is always the 
danger that only small parts of configuration space are sampled and physical quantities 
cannot be calculated accurately. 

New numerical approaches have been developed to deal with these sampling difficulties. 
Generalized-ensemble techniques (for a recent review, see, for instance, Ref. [Q) like 



multicanonical algorithms [^TJ and simulated tempering p2| allow an efficient sampling of 
low-energy configurations, and calculation of accurate low-temperature thermodynamic 
quantities became feasible. The first application of one of these techniques to the protein- 
folding problem is given in Ref. ||23|| . By comparing with recent experiments the usefulness 
of the approach was extensively tested and demonstrated in a study of helix-coil transitions 
of homo-oligomers of nonpolar amino acids |24|, [25|] and, more recently, of the C-peptide of 
ribonuclease A [p6| , [27] . A numerical comparison of three different generalized-ensemble 
algorithms can be found in Ref. |28| . 

Even with these new and sophisticated techniques, there are still many concerns and 
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difficulties in fully exploring the energy landscape of even medium-sized proteins, and for 
this reason we have restricted the analyses to only small peptides in this paper. 

In general, such peptides are too small to fold and therefore not useful as a model for 
the folding process in larger proteins. However, there are a few small peptides known which 
exist at low temperatures in stable defined conformations. For one of these peptides the 
characteristic temperatures of folding were recently determined numerically in Ref. p9| . 
These temperatures were shifted to lower values than expected for larger proteins, but 
otherwise the results of that study indirectly support the energy landscape picture and 
funnel concept. Here, we re-examine our data to directly investigate the free energy 
landscape of the peptide. In that respect, our work is similar to the earlier work in 
ref. |3(| but uses this new sampling technique. 

Our system of choice is the linear peptide Met-enkephalin which has the amino-acid 
sequence Tyr-Gly-Gly-Phe-Met. The experimental studies of this penta-peptide |3l|-p3| 
were mainly motivated by its biological significance as a neurotransmitter. Compact and 



defined structures were observed for membrane bound molecules | 32| , in crystals |3T|| or 
organic solvents. However, studies of this peptide in aqueous solution indicate that it 
exists at room temperature as an ensemble of extended coil structures with low frequency 
of folded structures |33| . 

Since the energy landscape is a function of a large number of degrees of freedom, it 
is obviously impossible to keep track of all coordinates. Hence, to actually observe the 
folding funnel of the peptide, one has to study a projection of the landscape onto a set 
of suitable and appropriate order parameters. For our choice of the order parameters, 
we were guided by previous numerical simulations of Met-enkephalin |2l|,[54j|-||41||. Using 
the ECEPP/2 force field |42 |, it was shown in a recent article [^9| that Met-enkephalin 
undergoes a transition between extended and compact structures at a temperature Tg = 
295±20 K. Above that temperature, the frequency of compact structures rapidly decreases 
while it increases below Tg. Hence, our first order parameter is the volume allowing us to 
distinguish between compact and extended conformations. In Ref. |2{| it was also shown 
that by further lowering the temperature the peptide encounters a second transition. 
Below Tf = 230 ± 30 K, the occupation of the ground-state conformations increases 
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rapidly while it decreases for values of T above Tf. The ensemble of low-temperature 
conformations was studied by various groups |23|,@]-[]33,[|3l§ and the numerical results 



compared with the experimental findings [3S]. These studies agree in that there are two 



major groups of well-defined compact structures which are characterized (and stabilized) 
by specific hydrogen bonding patterns. In Fig. 1 we show a sketch of the two structures. 
Structure A is the ground-state conformation in ECEEP/2 and has a Type II' /3-turn 
between the second and last residue, stabilized by two possible hydrogen bonds. The 
structure B, the second-lowest energy state, is characterized by hydrogen bond between 
Tyr-1 and Phe-4 resulting in a Type-II /3-turn between the first and fourth residue. We 
remark that at higher energies (in ECEPP more than 2 kcal/mol above the ground state) 
conformations with a 7-turn and those with hydrogen bonding between the first and last 
residue were also observed 0, f4lfl . Hence, we choose as further order parameters the 
overlap with the ground state (structure A) and the second-lowest-energy state (structure 
B), respectively, which allows us to distinguish between the various compact low-energy 
conformations. 

In the following sections we first review our simulation method and the ECEPP force 
field and show afterwards how the above order parameters can be defined and measured. 
Our results are then presented and we finish with our conclusions. 

SIMULATION TECHNIQUES 



The generalized-ensemble technique utilized in this article was first introduced in Refs. [43 



44]. In this algorithm, configurations are updated according to the following probability 
weight: 

where E is an estimator for the ground-state energy, np is the number of degrees of 
freedom of the system, and (3 = is the inverse temperature (ks is the Boltzmann 

constant and T the temperature of the system). Note that this weight is a special case of 
the weights used in Tsallis generalized mechanics formalism [f|5[| (the Tsallis parameter q 
is chosen as q = 1 + 1/np). In the low-energy region (where ^ E ~^ <C 1), the new weight 
reduces to the canonical Boltzmann weight exp(—/3E). On the other hand, high-energy 
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regions are no longer exponentially suppressed but only according to a power law, which 
enhances excursions to high-energy regions. We remark that the calculation of the weight 
is much easier than in other generalized-ensemble techniques, since it requires one to find 
only an estimator for the ground-state energy E , which can be done by a procedure 



described in Ref. 44 



As in the case of other generalized ensembles, we can use the reweighting techniques 



4b[ to construct canonical distributions at various temperatures. This is because the 
simulation by the present algorithm samples a large range of energies. The thermodynamic 
average of any physical quantity A can be calculated over a wide temperature range by 



dx A(x) w-\E{x)) e- pE{x) 

<A> T = J — r , (2) 

dx w-\E{x)) e~^ x) 



where x stands for configurations. 



FORCE FIELDS 



For our simulations we used the ECEEP/2 force field in which the potential energy 
function E tot is given by the sum of the electrostatic term Ec, 12-6 Lennard- Jones term 
Elj, and hydrogen-bond term Ehb for all pairs of atoms in the peptide together with the 
torsion term E tor for all torsion angles: 

E t ot = E c + E LJ + E HB + E tor , (3) 

Ec = (4) 

(id) en i 

- E (| - 1) • (5) 

E H b = Z)(d! ( 6 ) 

E tor = £l/,(l±cos(n,Xi))- (7) 
i 

Here, (in A) is the distance between the atoms % and j, and xi is the torsion angle 
for the chemical bond /. Bond lengths and bond angles are fixed at experimental values, 
leaving the dihedral angles 0, if), u, and \ as independent variables. We further fix the 
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peptide bond angles u to their common value 180°, which leaves us with 19 torsion angles 
(0, if), an d x) as independent degrees of freedom (i.e., np = 19). In our simulations we did 
not explicitly include the interaction of the peptide with the solvent and set the dielectric 
constant e equal to 2. However, we do expect some implicit solvent effect, since the vari- 
ous parameters (qi,Aij,Bij,Cij,Dij,Ui, and n{) for the energy function were determined 
by minimization of the potential energies of the crystal lattices of single amino acids, i.e., 
not in a vacuum. We remark that the computer code KONF90 ^7] which we used in our 
simulation relies on a different convention for the implementation of the ECEPP param- 
eters (for example, 0i of ECEPP/2 is equal to 0i - 180° of KONF90). Therefore, our 
energy values are slightly different from those of the original implementation of ECEPP/2. 



TECHNICAL DETAILS 

It is known from our previous work that the ground-state conformation for Met-enkephalin 
has the KONF90 energy value E GS = -12.2 kcal/mol |7|. We therefore set E = -12.2 
kcal/mol, T = 50 K (or, /3 = 10.1 [ kcal / mol ]) and np = 19 in our probability weight factor in 
Eq. (|1|). Our simulation was started from a completely random initial conformation (Hot 
Start) and one Monte Carlo sweep updates every torsion angle of the peptide once. All 
thermodynamic quantities were then calculated from a single production run of 1,000,000 
MC sweeps which followed 10,000 sweeps for thermalization. At the end of every fourth 
sweep we stored the energies of the conformation and our three "order parameters" (the 
corresponding volume, the overlap Oa of the conformation with the (known) ground state 
(structure A) and the overlap Op of the conformation with conformer B. Here, we approx- 
imate the volume of the peptide by its solvent excluded volume (in A 3 ) which is calculated 



by a variant |^8| of the double cubic lattice method J49j. Our definition of the overlap, 
which measures how much a given conformation resembles a reference state, is given by 



up 

\aY' — a) 



90 n F iz _ , 

where af and a\ RSS) (in degrees) stand for the np dihedral angles of the conformation at 
t-th Monte Carlo sweep and the reference state conformation, respectively. Symmetries 
for the side-chain angles were taken into account and the difference af — a\ RSS) was always 
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projected into the interval [—180°, 180°]. Our definition guarantees that we have 

< < O > T < 1 . (9) 

We remark that the average overlap < O >t approaches its limiting value zero (for 
T — > oo) only very slowly as the temperature increases. For instance, at T = 1000 K we 
found for the overlap with the ground state still has an average value of < Oa >~ 0.3. 
This is because < O >t = corresponds to a completely random distribution of dihedral 
angles which is energetically highly unfavorable due to the steric hindrance of both main 
and side chains. Note the obvious limit: Oa — > 1, as T — > 0. 

Since large parts of the configuration space are sampled by our method, the use of the 
reweighting techniques |46| is justified to calculate thermodynamic quantities over a wide 



range of temperatures by Eq. (|2]). Examples are the average potential energy < E > (V), 
the entropy 

S(V)=<E> (V)/k B T + log P(V) (10) 

(where P(V) is the probability to find a conformation with the volume V), and similar 
quantities defined as functions of the two overlaps Oa and Ob- We normalized all the 
above quantities in such a way that they are zero for the ground state. 

The above defined quantities allow only to study a projection of the energy land- 
scape into one dimension. To get a more detailed picture, we also explored for various 
temperatures the free energies 

G(O a ,O b ) = -k B T\ogP(0 A ,0 B ) (11) 

and 

G(O a , V) = -k B T\ogP(0 A , V) , (12) 

where again P(Oa, O b ) and P(0 A , V) are respectively the probabilities to find a peptide 
conformation with values A , B and Oa, V. We chose the normalization so that the 
lowest value of G(0 A , O b ) or G(O a , V) is set to zero for each temperature. 

Finally, in order to monitor the number of states in which the peptide can be found 
at temperature T, we also calculated the (unnormalized) entropy 

S(T) =< E > (T)/k B T - log Z(T) , (13) 



where Z(T) is the estimate of the partition function of the system as calculated from our 
data by the reweighting techniques. This quantity allows one to estimate the glass tem- 
perature T g , since the number of possible states should decrease drastically after entering 
the glassy phase. 



THE MET-ENKEPHALIN FUNNEL LANDSCAPE 

The energy landscape for a folding protein strongly depends on temperature. For some 
temperatures non-specific trapping may not be a problem but the situation may com- 
pletely reverse for other temperatures. Therefore, when exploring the folding landscape 
for this peptide, the simulations should take place at relevant temperatures, for otherwise 
one is not able to obtain a reliable picture of the folding mechanism. Hence, we have 
concentrated our analyses on four temperatures. The first one, T = 1000 K was chosen 
to probe the high-temperature regime where the peptide is fully unfolded and mostly in 



an extended form. In some early work ||29|| , some of us have identified T = 300 K as the 
collapse temperature Tg and T = 230 K as the folding temperature Tf. The last tempera- 
ture, T = 150 K, was chosen to study the low temperature behavior of the peptide where 
the glassy behhavior is observed. 

Fig. 2a displays the average potential energy < E > (V) as a function of the volume 
of the peptide for the four chosen temperatures. The plot shows that configurations 
with small volume (< 1400 A 3 ) have essentially the same energy. Above that value of 
the volume the energy increases with the volume. The increase is only gradual for high 
temperatures, but very steep below Tg. Similar results were found for the entropy S(V), 
displayed for the same four temperatures in Fig. 2b. The number of states varies slightly 
below a certain threshold but increases rapidly above that value. The steepness of that 
increase is again a function of temperature below Tg. The two plots indicate that one 
can distinguish between two regimes: compact structures which have similar energies 
and entropy and extended structures which are entropically favored but energetically 
disfavored. 

Fig. 3a (3b) shows the average energy (entropy) as a function of the overlap with 
structure A (the ground state in ECEPP/2). For the interpretation of the plots one has 
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to keep in mind that the overlaps approach zero very slowly with increasing energy or 
temperature. Results from our simulation are only trustworthy for temperatures T < 1000 
K where the overlap function has average value of ~ 0.3. Hence, in Figs. 3a and 3b the 
results are only reliable for values of the overlap function above 0.3. 

At high temperatures, the dependence of < E > (Oa) on the overlap Oa is essentially 
a monotonous function where the energy decreases with increasing value of the overlap. 
Below the collapse temperature Tg, however, the curves show a minimum at Oa ~ 0.5. 
This is the value Oa of Conformation B (the exact value being 0.46), and this minimum 
is therefore an indication of a basin of attraction to Conformer B. However, the overall 
slope of < E > (Oa) shows that Conformer A is favored. On the other hand, the entropy 
as a function of the overlap strongly varies with decreasing overlap for all but the lowest 
temperature. The absolute values of the entropy decreases with temperature. At T = 150 
K, however, S(Oa) stays essentially constant over the whole range of values of the overlap 
once this value is only a little different from 1. This means that at this temperature 
the number of states differs little with the overlap and therefore indicates the onset of 
glassy behavior. Similar pictures hold when one prints energy and entropy differences as 
function of the overlap with Conformation B. This kind of behavior has been predicted 
from simulations of minimalist models and now they have been confirmed for simulations 
of a real peptide. 

As the number of available states gets reduced with the decrease of temperature, 
the possibility of local trapping increases substantially. In the thermodynamic limit, the 
system would be trapped in these local traps for an infinite time as expected for a glass 
transition. To determine the glassy behavior for a finite system is a more subtle problem. 
In the folding problem, glassy behavior is associated when the residence time in some 
local traps becomes of the order of the folding event. Folding dynamics is now non- 
exponential since different traps have different escape times |5H| , I3|. For temperatures 



above the glass transition temperature T„, the folding dynamics is exponential and a 



configurational diffusion coefficient average the effects of the short lived traps |TQ, m], 52 



It is expected that for a good folder the temperature, T g , where glass behavior sets in, 
has to be significantly lower than the folding temperature Ty, i.e. a good folder can be 
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characterized by the relation pj 

^ > 1 ■ (14) 

Our plots of AS(Oa) indicate that we indeed have T g < Tf. This is also supported by 
Fig. 4 where the (unnormalized) entropy of the molecule is represented as a function of 
temperature. This plot clearly shows the rapid decrease of entropy once T reaches values 
smaller than T g . As one would expect from such a small molecule, it is impossible to 
determine a precise value for T g from the plot, but clearly the onset of glassy behavior 
happens at temperatures well below the folding temperature Tf = 230 K. In Ref. [29] it 



was already pointed out that these results are also consistent with an alternative charac- 
terization of folding properties. Thirumalai and collaborators |5^, |54| have conjectured 
that the kinetic accessibility of the native conformation can be classified by the parameter 

*=^, (15) 

i.e., the smaller a is, the more easily a protein can fold. With our values for Tg and Tf, 
we have for Met-enkephalin o « 0.2. Here, we have taken the central values: Tg = 295 
K and Tf = 230 K. This value of a implies that our peptide has reasonably good folding 
properties according to Refs. |53| and ||54j| . Hence, we see that there is a strong correlation 
between the folding criterion (Tf/T g > 1) proposed by Bryngelson and Wolynes || and 
this one. 

The main goal of this article is to depict the folding funnel of Met-enkephalin. Since it 
is not feasible to plot the free energy G as a function of all three order parameters, one has 
to plot G as a function of a suitable combination of the three relevant order parameters 
of the molecule. We chose to plot the free energy G(V, Oa) as a function of volume V and 
overlap Oa with the known ground state (Conformer A) and G(Oa, Ob) as a function of 
the overlap with the ground state {Oa) and with Conformer B (Ob)- Again we study 
these quantities for temperatures T = 1000 K, T = T e = 300 K, T = Tf = 230 K, and 
T = 150 K. They are shown in Fig. 5a-d and Fig. 6a-d, respectively. 

In Fig. 5a we show the free energy landscape as a function of volume and overlap 
with the known ground state (structure A) at the high-temperature region (T = 1000 
K). Here, (as in the other free energy plots) we normalized the free energy in such a 
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way that its observed minimum is set to zero. In the contour plots, the contour lines 
mark multiples of fcgT (therefore different for different temperatures but appropriate to 
understand the folding mechanism). We see that the free energy has its minimum at 
large volumes (~ 1470 A 3 ) and values of the overlap Oa ~ 0.3. Small volumes and larger 
values of the overlap are suppressed by many orders of ksT . Hence, extended random 
coil structures are favored at this temperature. The picture changes dramatically once 
we reach the collapse temperature Tg, shown in Fig. 5b. At this temperature a large 
part of the V-Oa space can be sampled in a simulation. The contour plot shows that 
regions with both small and large volumes and almost all values of Oa lie within the 2 
ksT contour. This indicates that at this temperature the cross over between extended 
and compact structures happens with a small thermodynamic barrier between them. By 



lowering the temperature to Tj = 230K (determined in ref. [29]), we now observe strong 
evidence for a funnel-like landscape (Fig. 5c). At this temperature the drive towards 
the native configuration is dominant and no long-lived traps exist. There is clearly a 
gradient towards the ground-state structure (Oa ~ 1), but other structures with similar 
volume (characterized by values of Oa ~ 0.5) are only separated by free energy barriers 
of order 1 ksT. Below this temperature we expect that the ground state is clearly favored 
thermodynamically and separated from other low energy states by free energy barriers of 
many orders of ksT. This can be seen in Fig. 5d where at T = 150i^ where other low 
energy states have free energies of 3 /cgT higher than the ground state and are separated 
by an additional barrier of 2 ksT. 

The above picture is supported by the plots for the free energy as a function of both 
the overlap Oa with the ground state and the overlap Ob with structure B. Fig. 6a shows 
again the high-temperature situation. The free energy has its minimum at small values 
of the overlap indicating that both conformers appear with only very small frequency at 
high temperature. At T = 300i^, the collapse temperature, again a large part of the 
space of possible configurations (characterized by values of Oa and Ob) lies within the 
2fcgT contour as is clear from Fig. 6b. At the folding temperature Tf = 230 K a funnel 
in the energy landscape appears with a gradient towards the ground state, but Fig. 6c 
shows that there are various other structures, the most notable of which is Conformer 
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B (where Ob ~ 1), with free energies 3 ksT higher than the ground-state conformation 
but separated from each other and the ground state only by free energy barriers less than 
1 ksT. No other long-lived traps are populated. Hence, the funnel at Tf is reasonably 
smooth. Folding routes include direct conversion from random-coil conformations into 
Conformer A or some short trapping in Conformer B region before reaching Conformer 
A region, but at the folding temperature it is possible to reach the ground state from any 
configuration without getting kinetically trapped. Kinetic Monte Carlo runs at a fixed 
temperature (T = 230 K) were performed and confirmed this picture (data not shown). 
We observed that some of the runs went directly from the unfolded state to the ensemble 
of folded conformations in state A, while in other runs trapping at state B occurred first 
before folding into the ground-state structure. The kinetic runs therefore support our 
observation that Met-enkephalin is a good folder and that Tf > T g . Fig. 6d shows the 
situation for T = 150 K where we expect onset of glassy behavior. Again one sees a 
funnel-like bias toward the ground state, however, the funnel is no longer smooth and the 
free energy landscape is rugged. Free energy barriers of many ksT now separate differ- 
ent regions and would act as long-lived kinetic traps in a canonical simulation rendering 
folding at this temperature extremely difficult. 

CONCLUSIONS 

In this article we have studied the free energy landscape of the peptide Met-enkephalin, 
using generalized-ensemble techniques. Although the peptide is rather small, the obtained 
free energy landscape shows a funnel towards the ground state even for temperatures well 
below the folding temperature Tf. It was shown that glassy behavior appears only well 
below this temperature and that the peptide is a good folder at Tf. Our results demon- 
strate that the energy landscape picture and funnel concept of folding that have been 
demonstrated in the past for minimalist models are much more general and hold for this 
peptide when simulated with an all-atom force field. 

Acknowledgments: 



13 



Our simulations were performed on computers of the Institute for Molecular Science 
(IMS), Okazaki, Japan. This work is supported by a Grant-in- Aid for Scientific Research 
from the Japanese Ministry of Education, Science, Sports and Culture. Work in San 
Diego was also supported by the National Science Foundation (Grant # 96-03839). Work 
in Houghton was supported by funds from Michigan Technological University. JNO was 
a visitor at the IMS when part of this work was performed and he thanks Y. Tanimura 
for his hospitality during his stay in Japan. 

References 

[1] Anfinsen, C. B. Principles that govern the folding of protein chains. Science 181: 
223-230, 1973. 

[2] Dill, K. A., Chan, H. S. From Levinthal to pathways to funnels. Nature Structural 
Biology 4: 10-19, 1997. 

[3] Sali, A., Shakhnovich, E.I., Karplus, M. Kinetics of Protein Folding — A Lattice 
Model Study of the Requirements for Folding to the Native State. J. Mol. Biol. 
235: 1614-1636, 1994. 

[4] Guo, Zh., Thirumalai, D. Kinetics and Thermodynamics of Folding of a de Novo 
Designed Four-helix Bundle Protein. J. Mol. Biol. 236: 323-343, 1996. 

[5] Bryngelson, J.D., Wolynes, P.G. Spin glasses and the statistical mechanics of pro- 
tein folding. Proc. Natl. Acad. Sci. (USA) 84: 7524-7528, 1987. 

[6] Leopold, P. E., Montal, M., Onuchic, J. N. Protein folding funnels: Kinetic path- 
ways through compact conformational space. Proc. Natl. Acad. Sci. (USA) 89: 
8721-8725, 1992. 

[7] Onuchic, J. N., Luhey-Schulten, Z., Wolynes, P.G. Theory of protein folding: the 
energy landscape perspective. Annu. Rev. Phys. Chem. 48: 545-600, 1997. 



14 



[8] Bryngelson, J. D., Onuchic, J. N., Socci, N. D., Wolynes, P. G. Funnels, Pathways 
and the Energy Landscape of Protein Folding: A Synthesis. Proteins 21: 167-195, 
1995. 

[9] Onuchic, J. N., Wolynes, P. G., Luthey-Schulten, Z., Socci, N. D. Towards an 
Outline of the Topography of a Realistic Protein Folding Funnel. Proc. Natl. Acad. 
Sci. (USA) 92: 3626-3630, 1995. 

[10] Socci, N. D., Onuchic, J. N., Wolynes, P. G. Diffusive Dynamics of the Reaction 
Coordinate for Protein Folding Funnels. J. Chem. Phys. 104: 5860-5871, 1996. 

[11] Guo, Zh., Thirumalai, D. Kinetics of Protein Folding - Nucleation Mechanism, 
Time Scales, and Pathways. Biopolymers 36: 103-116, 1995. 

[12] Mirny, L.A., Abkevich, V., Shakhnovich, E.I. Universality and Diversity of the 
Protein Folding Scenarios: A comprehensive analysis with the aid of a lattice model. 
Fold. & Des. 1: 103-116, 1996. 

[13] Friedrichs, M.S., Goldstein, R.A., Wolynes, P.G. Generalized protein tertiary struc- 
ture recognition using associative memory hamiltonians. J. Mol. Biol. 222: 1013— 
1034, 1991. 

[14] Guo, Zh., Brooks III, C.L. Thermodynamics of Protein Folding: A Statistical Me- 
chanical Study of a Small All (3 Protein. Biopolymers 42: 745-757, 1997. 

[15] Nymeyer, H., Garcia, A.E., Onuchic, J.N. Folding Funnels and Frustration in Off- 
Lattice Minimalist Protein Landscapes. Proc. Natl. Acad. Sci. (USA) 95: 5921- 
5928, 1998. 

[16] Shea, J-E, Nochomovitz, Y.D, Guo, Zh, Brooks III, C. L. Exploring the Space 
of Protein Folding Hamiltonians: The Balance of Forces in a Minimalist /5-Barrel 
Model, submitted to J. Chem. Phys., 1998. 

[17] Hardin, C, Luthey-Schulten, Z.A., Wolynes, P.G. Backbone dynamics, fast folding, 
and secondary structure of helical proteins and peptides, submitted to Proteins, 
1998. 

15 



[18] Wolynes, P.G., Luthey-Schulten, Z.A., Onuchic, J.N. Fast-folding experiments and 
the topography of protein folding energy landscapes. Chemistry & Biology 3: 425- 
432, 1996. 

[19] Vasquez, M., Nemethy, G., Scheraga, H. A. Conformational Energy Calculations 
of Polypeptides and Proteins. Chem. Rev. 94: 2183-2239, 1994. 

[20] Hansmann, U.H.E., Okamoto, Y. The Generalized- Ensemble Approach for Protein 
Folding Simulations, to appear in: Stauffer, D. (ed.) 'Annual Reviews in Compu- 
tational Physics". Singapor: World Scientific 1998. 

[21] Berg, B. A., Neuhaus, T. Multicanonical algorithms for first order phase transitions. 
Phys. Lett. 267: 249-253, 1991. 

[22] Lyubartsev, A. P., Martinovski, A. A., Shevkunov, S. V., Vorontsov-Velyaminov, 
P. N. New approach to Monte Carlo calculations of the free energy: Method of 
expanded ensembles. J. Chem. Phys. 96: 1776-1783, 1992; Marinari, E., Parisi, G. 
Simulated Tempering: A new Monte Carlo Scheme. Europhys. Lett. 19: 451-458, 
1992. 

[23] Hansmann, U. H. E., Okamoto, Y. Prediction of Peptide Conformation by Multi- 
canonical Algorithm: A New Approach to the Multiple-Minima Problem. J. Comp. 
Chem. 14: 1333-1338, 1993. 

[24] Okamoto, Y., Hansmann, U. H. E. Thermodynamics of Helix - Coil Transitions 
Studied by Multicanonical Algorithms. J. Phys. Chem. 99: 11276-11287, 1995. 

[25] Hansmann, U. H. E., Okamoto, Y. Finite-size scaling of helix-coil transitions in 
poly-alanine studied by multicanonical simulations, to appear in J. Chem. Phys. 

[26] Hansmann, U.H.E., Okamoto, Y. Tertiary structure prediction of C-peptide of 
ribonuclease A by multicanonical algorithm. J. Phys. Chem. 102: 653-656, 1998. 

[27] Hansmann, U.H.E., Okamoto, Y. Effects of side-chain charges on a-helix stability 
in C-peptide of ribonuclease A studied by multicanonical algorithm, submitted for 
publication. 

16 



[28] Hansmann, U. H. E., Okamoto, Y. Numerical Comparisons of Three Recently Pro- 
posed Algorithms in the Protein Folding Problem. J. Comp. Chem. 18: 920-933, 
1997. 

[29] Hansmann, U.H.E., Masuya, M., Okamoto, Y. Characteristic Temperatures of Fold- 
ing of a Small Peptide. Proc. Natl. Acad. Sci. (U.S.A.) 94: 10652-10656, 1997. 

[30] Boczko, E. M. & Brooks, C. L. III. First principles calculation of the folding free 
energy for a three helix bundle protein. Science 269: 393-396, 1995. 

[31] Aubry, A., Birlirakis, N., Sakarellos-Daitsiotis, M., Sakarellos, C. A Crystal Molec- 
ular Conformation of Leu-enkephalin related to the Morphin Molecule. Biopolymers 
28: 27 - 40, 1989. 

[32] Behnam, B.A., Deber, CM. Evidence for a Folded Conformation of Methionine- 
and Leucine-Enkephalin in a Membrane Enviroment. J. Biol. Chem. 259: 14939 - 
14940, 1984. 

[33] Karayannis, T., Gerothanassis, LP., Sakarellos-Daitsiotis, M., Sakarellos, C, Mar- 
raud, M. 17 and 14 N-NMR Studies of Leu-Enkephalin and Enkephalin Related 
Fragments in Aqueous Solution. Biopolymers 29: 423 - 439, 1990. 

[34] Li, Z., Scheraga, H.A. Monte Carlo Minimization approach to the multiple-minima 
problem in protein folding. Proc. Nat. Acad. Sci. (U.S.A.) 84: 6611 - 6615, 1984. 

[35] Freyberg, B. von, Braun, W. Efficient search for all low energy conformations of 
polypeptides by Monte Carlo methods. J. Comp. Chem. 12: 1065-1076, 1991. 

[36] Okamoto, Y., Kikuchi, T., Kawai, H. Prediction of low-energy structures of Met- 
enkephalin by Monte Carlo simulated annealing. Chem. Lett. 1992: 1275-1278, 
1992. 

[37] Hansmann, U. H. E., Okamoto, Y. Comparative Study of Multicanonical and Sim- 
ulated Annealing Algorithms in the Protein Folding Problem. Physica A 212: 415- 
437, 1994. 



17 



[38] Montcalm, T., Cui, W., Zhao, H., Guarnieri, F., Wilson, S.R. Simulated anneal- 
ing of met-enkephalin: low energy states and their relevance to membrane-bound, 
solution and solid-state conformations. J. Mol. Struc. (Theochem) 308: 37-51, 1994. 

[39] Eisenmenger, F., Hansmann, U.H.E., Variation of the Energy Landscape of a 
Small Peptide under a Change from the ECEPP/2 Force Field to ECEPP/3. 
J. Phys. Chem. B 101: 3304-3310, 1997. 

[40] Hansmann, U.H.E. Parallel Tempering Algorithm for Conformational Studies of 
Biological Molecules. Chem. Phys. Lett. 281: 140-150, 1997. 

[41] Mitsutake, A., Hansmann, U. H. E., Okamoto, Y., in preparation. 

[42] Sippl, M. J., Nemethy, C, Scheraga, H. A. Intermolecular potentials from crystal 
data. 6. Determination of empirical potentials for O-H- • -0=C hydrogen bonds 
from packing configurations. J. Phys. Chem. 88: 6231-6233, 1994; and references 
therein. 

[43] Hansmann, U. H. E. Simulated Annealing with Tsallis Weights - A numerical Com- 
parison. Physica A 242: 250-257, 1997. 

[44] Hansmann, U. H. E., Okamoto, Y. Generalized-Ensemble Monte Carlo Method for 
Systems with Rough Energy Landscape. Physical Review E 56: 2228 - 2233, 1997. 

[45] Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 52: 
479-487, 1988. 

[46] Ferrenberg, A. M., Swendsen, R. H. Optimized Monte Carlo data analysis. 
Phys. Rev. Lett. 61: 2635-2638, 1988; Phys. Rev. Lett. 63: 1658-1658(E), 1989; 
and references given in the erratum. 

[47] Kawai, H., Okamoto, Y., Fukugita, M., Nakazawa, T., Kikuchi, T. Prediction 
of a-helix folding of isolated C-peptide of ribonuclease A by Monte Carlo simu- 
lated annealing. Chem. Lett. 1991: 213-216, 1991; Okamoto, Y., Fukugita, M., 
Nakazawa, T. Kawai, H. a-helix folding by Monte Carlo simulated annealing in 
isolated C-peptide of ribonuclease A. Protein Engineering 4: 639-647, 1991. 

18 



[48] The program for the calculation of solvent excluded volume was written by M. Ma- 
suya and will be described in detail elsewhere. 

[49] Eisenhaber, F., Lijnzaad, P., Argos, P., Sander, C, Scharf, M. The Double Cubic 
Lattice Method: Efficient Approaches to Numerical Integration of Surface Area and 
Volume and to Dot Surface Contouring of Molecular Assmblies. J. Comp. Chem. 
16: 273-284, 1995. 

[50] Socci, N.D., Onuchic, J.N., Wolynes, P.G. Protein Folding Mechanisms and the 
Multidimensional Folding Funnel. Proteins 32:136:158 (1998). 

[51] Scalley, M.L. & Baker, D. Protein folding kinetics exhibit an Arrhenius temperature 
dependence when corrected for the temperature dependence of protein stability. 
Proc. Nat. Acad. Sci. (USA) 44: 10636-10640, 1997. 

[52] Plotkin, S.S., Wolynes, P.G. Non-Markovian Configurational Diffusion and Reac- 
tion Coordinates in Protein Folding. Phys. Rev. Let. 22: 5015-5018, 1998. 

[53] Camacho, C.J., Thirumalai, D. A Criterion That Determines Fast Folding of Pro- 
teins: A Model Study. Proc. Natl. Acad. Sci. USA 90: 6369-6372, 1993. 

[54] Klimov, D. K., Thirumalai, D. Factors Governing the Foldability of Proteins. Pro- 
teins 26: 411-441 (1996). 

[55] Sayle, R.A., Milner- White, E.J. RasMol: Biomolecular graphics for all. TIBS 20: 
374-376 1995. 



19 



Figures: 



Fig. 1: Backbone structures of the two dominant low-energy structures with their 



characteristic hydrogen bonding. The figures were created with RasMol ]5q| . Fig. la 
displays conformer A, the ground state in ECEPP/2 (with a KONF90-energy of 
—12.2 kcal/mol). Conformer B in Fig. lb is the local minimum with the second 
lowest potential energy (—11.0 kcal/mol in KONF90). 

Fig. 2: a) Potential energy < E > (V) (in kcal/mol) and b) entropy S(V) (as 
defined in Eq. |X0]) as a function of the volume (in A 3 ) of the peptide for various 
temperatures (in K). We chose units where for the ground state < E > (V) = 
and S(V) = 0. 

Fig. 3: a) Potential energy < E > (Oa) (in kcal/mol) and b) entropy S(Oa) as a 
function of the overlap Oa (defined in the text) for various temperatures (in K). We 
chose units where for the ground state < E > (Oa) = and S(Oa) = 0. 

Fig. 4: Unnormalized entropy S(T) (as defined in Eq. |T3|) as a function of temper- 
ature (in K). 

Fig. 5: Free energy G(V, Oa) (in kcal/mol) as a function of both peptide volume V 
(in A 3 ) and overlap Oa (as defined in the text) for a) T = 1000 K, b) T = 300 K, 
c) T = 230 K, and d) T = 150 K. Both the free energy surface and the contour plot 
are shown. The contour lines are multiples of ksT. G(V,Oa) was normalized such 
that mm(G(V,0 A )) = 0. 

Fig. 6: Free energy G(Oa, Ob) as a function of both overlaps Oa and Ob (as defined 
in the text) for a) T = 1000 K, b) T = 300 K, c) T = 230 K, and d) T = 150 K. 
Both the free energy surface and the contour plot are shown. The contour lines are 
multiples of ksT. G(Oa,Ob) was normalized such that min(G(OA, Ob)) = 0. 



20 



f = 1000K 




15 



10 







T = 

T: 
T: 
T: 



1000K 
= 300K 
= 230K 
= 150K 



+ 
x 

X 

□ 



□ 



□ 



□ 



□ 



x x : 
x xxi*x 



x 

x vX x 



*J**xxx 



X 



.++++++ 



***** 



,x>$ ++ xx * 



□ 



□ 




1 320 1 340 1 360 1 380 1 400 1 420 1 440 1 



Volume V 



A 
< 

g 

LU 

V 



25 + - 



20 - 



15 - 



10 * 



5 - 







□ 



+ 



x x 



□ n 



+ 



+ 



+ 



x X X X x 



X X * 



X- 



□ □ □ □ 



T = 
T = 
T = 
T = 

+ 



10i 

3i 
2: 

1! 



+ 



X 



X 



□ 



X 



X 



□ 



X 

X 

□ □ 
l_ 



0.3 0.4 0.5 0.6 0.7 0.6 

Overlap 0_A 



< 

o 

c/5 



45 

40 : 
35 - 
30 - 
25 
20 I 
15 - 

□ 

10 - 

5 - 




+ 



x 



□ □ 



+ 



x X 



□ □ 



+ 



+ 



xxx 



* X x x * * 



□ 



□ 



T = 
T = 
T = 
T = 



10i 

3i 
2: 
1; 



+ 



X 



X 



X 



X 



X 



X 



□ 



□ □ 



X 



X 



□ 



0.3 0.4 0.5 0.6 0.7 0.6 

Overlap 0_A 



CO 



50 - 



- 



-50 - 



-100 - 



-150 



+ 



+ 







.+++" 



.+++' 



.++++" 



. ++ +++ +++ ^" 



+ 



, + ' 



.+ 



+ 



Entropy S(T) = <E>/(k*T) 



200 400 



600 



T[K] 



T = 1000 K 

G(0_A,V) 




G(0_A,V) 



T = 300 K 




G(0_A,V) 



T = 230 K 




G(0_A,V) 



T = 150 K 




T = 1000 K 



G(0_A,0_B) 




T = 300 K 




T = 230 K 




T = 150 K 




